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CANCELLATION FOR THE MULTILINEAR HILBERT TRANSFORM 


TERENCE TAG 


Abstract. For any natural number k, consider the fc-linear Hilbert transform 

r 

Hk{fi,---,fk){x) :=p.v. fi{x + t)...fk{x + kt) — 

Jr * 

for test functions fi,..., fk ■ ^ ^ C. It is conjectured that Hk maps (K.) x • • • x 

LP’‘ (K) —> LP{W) whenever 1 < pi,... ,pk,p < oo and ^ ^ ‘ ‘ This is proven 

for k = 1,2, but remains open for larger k. 

In this paper, we consider the truncated operators 

J ' dtf 

fi{x + t)...fk{x + kt) — 

for R > r > 0. The above conjecture is equivalent to the uniform boundedness of 
||J?fe.r,R||LPi(R)x' 'XL!’fc(R)->-LP(R) in Rj whereas the Minkowski and Holder inequalities 
give the trivial upper bound of 2 log -p for this quantity. By using the arithmetic 
regularity and counting lemmas of Green and the author, we improve the trivial upper 
bound on ||i?fc,r,R||LPi(R)x-xLPfc(R)^LP(R) slightly to o(logf) in the limit co for 

any admissible choice of k and pi,... ,pk,p. This establishes some cancellation in 
the fc-linear Hilbert transform Hk, but not enough to establish its boundedness in 
spaces. 


1. Introduction 


For any natural number k and test functions fi,..., fk : M ^ C, define the Ulinear 
Hilbert transform Hk{fi ,..., fk) : M ^ C by the formula 

r df 

Hk{fi,---,fk){x) :=p.v. fi{x + t)...fk{x + kt) —, 

Jr I 


or more explicitly 


Hk{fi,...,fk)ix)= lim Hk^rAfiy ■ ■ Jk){x) (1.1) 

r^O,R^oo 

where Hk^r,R is the truncated fc-linear Hilbert transform 

r ri-h 

HkxAh, ■ ■ ■ Jk){x) ■.= \ fi{x + t) ...fk{x + kt) (1.2) 

Jr^|t|^R k 

The operator Hi is the classical Hilbert transform, which as is well known (see e.g. 
m) is bounded on Lp(R) for every 1 < p < co. The operator H 2 is the bilinear 
Hilbert transform; it was shown by Lacey and Thiele mu using time-frequency analysis 
techniques that H2 maps L^i(M) x to H(R) whenever 1 < p,pi,P2 < and 

A + T = L in fact, they were able to relax the constraint p>ltop>2/3, however 
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in this paper it will be convenient to restrict0 to the “Banach space case” when all 
exponents are greater than 1. The same argument shows the corresponding bounds for 
H 2 ,r,R that are uniform in r, R] that is to say, one has 

11-^2,r,_ r(/ij/2)||lp(R) < C'p,pi,p2 II/l II (R) II/2IILP2 (R) 

whenever 1 < p,pi,p 2 < oo, ^ ^ /i e Lp^{R), /a e Lp^{R), and 0 < r < i?, 

where is a quantity independent of r, R. Note that the condition ^ ^ = p is 

necessary from dimensional analysis (or scaling) considerations. 

From these facts, one may make the following conjecture. 

Conjecture 1.1. Let ^ 1 and 1 < pi,... ,pk,p < oo be such that p = + ''' + 

Then one has 

\\Hk,r,R{fl, ■ ■ ■ , /fc)llLP(R) ^ C'fc,p,pi,...,pfe||/l||LPi(R) • • • ||/fc||LPfe(R) (1-3) 

whenever 1 < p,pi, ...,pk<cc, j- + ■ ■ ■ + j- = 1, LPi{R) for i = and 

0 < r < R, where Ck,p,pi^...,pt. ® quantity independent ofr, R. In particular, from fll.lj) 

and Patou’s lemma we have 

\\Hk{fl, ■ ■ ■ , /fc)llLP(R) < Ck,p iPiv.Pfcll/l||TPi(R) • ■ ■ ||/fc||LPfe(R) (1-4) 

for all test functions fi,..., fk ■ R ^ C. 

As mentioned above, this conjecture is established for k = 1,2, but is completely open 
for larger values of k. For instance, in the case A: = 3 of the trilinear Hilbert transform 
i/ 3 , no IP bounds whatsoever are known. Although it is not needed to motivate our main 
results, we also remark that the implication of fll.ip from fll.dp can be reversed (with 
some loss in the multiplicative constant); if fll.ip holds, then by restricting /i, ...,/*, to 
intervals of length R, applying fll.ip to these restrictions, and averaging over all such 
intervals (using Minkowski’s inequality and Holder’s inequality to estimate some error 
terms) it is not difficult to show that 

II lini ^A:,r,R(/l, • • • ,/fc)||LP(R) < C'^,p,pi,...,pJ\fl\\Ln{v:) ■ • • \\fk\\LPk{R) 

for some constant and test functions fi,..., fk, and then on subtracting this 

bound for two different choices of R, r and using a limiting argument we obtain 01.31) 
(with a slightly worse constant). We leave the details to the interested reader. 

One can approach Conjecture ll.ll bv introducing the operator norm Ck,p,pi,...^p,.{R/r) of 
Hk,r,Ri defined as the best constant for which one has 

\Hk,r,R{fl-, • • • , /fc)llLP(R) ^ C'fc,p,pi,...,pfc(-RA)||/l||LPi(R) • • • ||/fc||(R) 

for all /i e Lp^{R), ..., /^ e LP''(R). Note from scaling that the operator norm of IIk,\r,\R 
is the same as that of IIk^r,R for any A > 0, which is why we write the operator norm 
Ck,p,pi,...,pk{R/p) ^ function of the ratio R/r rather than of R, r separately. Conjecture 

II.Il ls then equivalent to the assertion that Ck,p,pi,...,pf,{R/r) remains bounded in the limit 

^See also the negative results of [2] showing that i ?3 can be unbounded for certain values of p below 


1. 
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Rjr 00. On the other hand, from fll. 2 p . Minkowski’s integral inequality and Holder’s 
inequality we have the trivial bound 




dt 


1 R 

= 21 og—. 


Our main result is the following slight improvement of the trivial bound. 


Theorem 1.2 (Improvement over trivial bound). Let k ^ 1 and 1 < pi,... jPk,P < oo 
be such that ^ ^ + ’ ’' + let e > 0. Then, if R/r is sufficiently large depending 

on e, k,pi,... ,pk,p, one has 

R 

Ck,p,p,,...,p^{R/r) ^ £log-. 


This falls well short of Conjecture 11.11 but it does show that some cancellation is oc¬ 
curring in the A:-hnear Hilbert transform. 

A novel featur^ in our arguments is the introduction of tools from arithmetic combi¬ 
natorics, particularly the theory of higher degree Gowers uniformity that was initially 
developed in to provide a new proof of Szemeredi’s theorem [H] on arithmetic 

progressions. Such tools are known to be useful for controlling expressions such as 

2 fo{x)fi{x + t)... fk{x + kt) 

X,tG'lA 

for various bounded, compactly supported functions /o,..., /^ : Z ^ C, so it is not so 
surprising in retrospect that they should also be able to say something non-trivial about 
integral expressions such as fll.2p . Unfortunately, at the current state of development of 
the theory of higher degree uniformity, the quantitative bounds arising from these tools 
are quite poor for k ^ 3 (with no explicit bounds whatsoever in the current literature for 
/c ^ 5), and so one would need a signihcant quantitative strengthening of the arithmetic 
combinatorics results, or the introduction of additional techniques, if one were to hope 
to make substantial progress towards Conjecture 11.11 bevond Theorem 11.21 

Roughly speaking, the strategy of proof of Theorem 11.21 is as follows. After some 
reductions reminiscent of those in [HE], as well as a discretisation step in which one 
replaces the real line M with the integers Z, one reduces matters to establishing a “tree 
estimate” in which one demonstrates non-trivial cancellation in expressions roughly of 
the form 

Yj fo{x)fi{x + t)... fk{x + kt)ij{t/2'^)<p{2~'^x - j) (1.5) 

for some smooth compactly supported functions ' 0 ,(p, some parameters n,j, and some 
bounded functions /q, ..., /^ : Z ^ R. Crucially, the function f will be odd, reflecting 
the odd nature of the Hilbert kernel y appearing in fll.ip . A standard “generalised 
von Neumann theorem” from arithmetic combinatorics tells us that expressions of the 

^See also [T] and [7] for previous appearances of methods from arithmetic combinatorics in bounding 
multilinear operators related to H^. 
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form fll.Sp are negligible if at least one of the fnnctions /j is very small in a certain 
Gowers nniformity norm. We then apply an arithmetic regularity lemma from [1] that 
asserts, ronghly speaking, that any bonnded fnnction /j can be approximated (np to 
errors small in Gowers nniformity norm, pins an additional error small in an sense) 
with a special type of function, namely an irrational virtual nilsequence. This effectively 
allows one to replace all the functions fo, ■ ■ ■, fk in fll.hp with such nilsequences. The 
point of this reduction is that irrational nilsequences enjoy a counting lemma (also from 
0) that allows one to obtain good asymptotics for expressions such as fll.Sp . At this 
point, the fact that if: is odd ensures that the main term in those asymptotics vanish, 
and the surviving error terms turn out to be small enough to eventually obtain the 
required conclusion in Theorem 11.21 


1.1. Acknowledgments. The author is supported by NSF grant DMS-1266164 and by 
a Simons Investigator Award. He thanks Giprian Demeter and Vjeko Kovac for helpful 
comments, and Pavel Zorin-Kranich for corrections. 


1.2. Notation. We use the asymptotic notation X < Y, Y > X, oi X = 0{Y) to 
denote the assertion that |A| < CY for some absolute constant C, which we call the 
implied constant. We will sometimes need to allow the implied constant to depend 
on additional parameters, in which case we indicate this by subscripts, e.g. X <s Y, 
Y Y, OT X = Os(Y) denotes the assertion that |X| < CsY for some Cs depending 
on 6. For brevity we will sometimes £x some basic parameters (e.g. k) and allow all 
implied constants to depend on such parameters (so that, for instance, X = Os(Y) is 
now short for X = Os^kiX))- We also write X--YioiX<Y<X. 

We also use the asymptotic notation X = 0n^co(X) to denote the assertion |X| < c{N)Y 
where c{N) is a quantity depending on a parameter N that goes to zero as N goes to 
inhnity. Again, if we need c{N) to depend on external parameters, we will indicate this 
by subscripts; for instance, X = ON^oD-,k(X) denotes the assertion that |X| < Ck{N)Y 
where Ck{N) goes to zero as N ^ co for each fixed choice of k. 


2. Initial reductions 

We begin the proof of Theorem 11.21 For technical reasons (having to do with the fact 
that the arithmetic regularity and counting lemmas in the literature are phrased in a 
discrete setting rather than a continuous one) we will need to transfer Theorem 11.21 
from the reals M to the integers Z, giving up the scale invariance of the problem in the 
process. Namely, we will derive Theorem 11.21 from the following discrete version of that 
theorem. 

Theorem 2.1 (Discrete version of main theorem). Let k ^ 1 and 1 < pi,... ,pk,p < oo 
be such that p = ^ ‘ o,nd let e > 0. Then, if R ^ r ^ 1 and R/r is sufficiently 
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large depending on e, k,pi,... ,pk,p, one has 




/i(x + t)... fkjx + kt) 
t 



m (z) 


for all fi e i = 1,.. .,/c, where the ff’ norm on the left-hand side is in the x 

variable. 


Let us assume Theorem 12.11 for the moment and see how it implies Theorem 11.21 We 
will use a standard transference argument. Let k,pi,... ,Pk,P-, £, R, r be as in Theorem 
oi with R/r assumed large enough depending on pi,... ,pk,p,e. Let A > 0 be a large 
quantity (depending on k,pi,... ,p, e, R,r) to be chosen later. For A large enough, we 
have XR ^ Xr ^ 1, and so by Theorem 12. II we have 


L 


fi{x + t). ..fkjx + kt) 
t 



for all fi e In particular, given fi e L^*(R) for i = 1,... ,k, we have 


L 

tGZ:Ar^|t|^Ai? 


/i(x + t + 6)... fk{x + kt + 6) 
t 


^ £ 


R 




X + 


ipi^ 


i=l xeZ 


1 /Pi 


for any 0 < 0 ^ 1. Averaging (in LX) over all such 9 and using Holder’s inequality and 
Fubini’s theorem, we conclude that 


L 

tGZ:Ar^|t|^Ai? 

Rescaling by A, we conclude that 


fi{x + t) ... fk{x + kt) 




R 




i I LRi ( 


LP{R) 


2=1 




fi{x + t)... fk{x + kt) 




^ £ 


R 


log-Hi/. 


i\LPi( 


LP(R) 


i=l 


Sending A —>■ oo and using Riemann integrability and Fatou’s lemma, we conclude that 

k 


I 




dt 

fi{x + f)... fk{x + kt) -J 


^ £ 


logi^ni/' 


LP{R) 


i I LPi ( 


2 = 1 


if the fi are continuous and compactly supported. Applying a limiting argument, we 
obtain Theorem 11.21 


Remark 2.2. In the converse direction, one can derive Theorem 12.11 from Theorem 11.21 
by applying the latter to functions of the form x i—» /j([xj); we leave the details to the 
interested reader. 


It remains to establish Theorem IZI We will perform a number of preliminary reduc¬ 
tions analogous to those in 0 , 0 , namely a reduction to a dyadic restricted weak-type 
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estimate, and the construction of various “trees” of dyadic intervals, with the key non¬ 
trivial input being a tree estimate (see Proposition 12.41 below) that improves over the 
trivial bound coming from the triangle inequality. 


We turn to the details. By duality, it will suffice to show that 


tGX:r^\t\^R xsIj 


fo{x)fi{x + t)... fk{x + {k- l)t) 


R 




iUPi( 


i=0 


whenever k ^ 1, e > 0, 1 < po,... ,Pk < oo are such that ^ -t- • • • + ^ = 1, /j e 
L^*(R) for z = 0,..., /c, i? ^ r ^ 1, and R/r is sufficiently large depending on e. By 
multilinear interpolation (and modifying e,pQ,... ,pk as necessary), we may replace the 
strong Lebesgue norms here by the Lorentz norms £^*’^(Z), and then by convexity 

we may reduce to the case where each of the /j are indicator functions, thus it will suffice 
to show that 


^ ^Eo{x)IeA^ + t).. .lE^jx + {k - l)t) ^ ("2 1) 

te'L:r^\t\^Rxe7^ ^ ^ i=0 

whenever e > 0, 1 < po, • • •; Pfc < oo are such that ^ -t- ■ ■ ■ -f ^ = 1, i?* are subsets of Z 
with finite cardinality \Ei\, and R ^ r ^ 1 with R/r is sufficiently large depending on 
£. Here of course 1e denotes the indicator function of E. 


Henceforth k,po,... ,Pk will be hxed, and all implied constants will be allowed to depend 
on these parameters. From Hblder’s inequality we have 

Y^Eo{x)lEAx + t) ■■■ lEki^ + {k — l)t) dx ^ min \Ei\ 
for any t, and thus 


teI,-.rsi\t\siRxeI. 


1eo{x)1ei{x + t) ... 1 ei,{x + {k- l)t) 

t 


. R 

^ log — mm lEA. 
r 


Comparing this with fl2.ip . we see that we are done unless 

\Ei\ |i?o| (2-2) 

for all 0 < z ^ A:. Henceforth we will assume that fl2.2p holds. It will now suffice (after 
adjusting e if necessary) to show that 


xel. 

if R/r is sufficiently large depending on e. 


Ieo{x)Iei{x + t)... lEk{x + {k- l)t) 


< e\og—\Eo 


(2.3) 


The next step is a decomposition into dyadic intervals. Let z/^ : M ^ R be a fixed 
smooth odd function supported on [—2, —1/2] u [1/2, 2] with the property that 

2] 2-X2-”() = i 
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for all t ¥= 0; such a function can be constructed by taking 'tp{t) := some 

smooth even 0 : M ^ M supported on [—1,1] and equaling 1 on [~l/2,1/2]. Henceforth 
we allow implied constants to depend on ip. Then the function differs from 

2 jnrs;2'is:i? o^ly when t ~ i? or f ~ r, where both functions are 0{1/R) 

and 0(l/r) respectively. From this and the triangle inequality one sees that fl2.3p is 
equivalent to 


2 2 ” 2 1eo{x)1ei{x + t).. .1 e^{x + kt)'ilj{t/T) 


< elog—/So 

r 


since the left-hand side here differs from that of 02.31) by 0(1), which is acceptable if 
R/r is large enough depending on e. By the triangle inequality, it thus suffices to show 
that 




n:rsS2"iSR 


lEo(a^)l£;i(3; + ^) • • • l£;j,(x -F kt)'tp{t/T^) dxdt 


R 

< elog—|F; o| 
r 


We now introduce a further smooth function : M ^ M supported on [—1,1] such that 

Yj - /) = 1 

jsZ 

for all a: e M; indeed one can take (p(x) := rj{x) — r]{x + 1) for some smooth r; : M ^ M 
equal to 1 for negative x and 0 for a: > 1. We allow implied constants to depend on ip. 
For eachU (discrete) dyadic interval / = {x e Z : j2"' < x ^ {j + 1)2"} with n ^ 0, we 
dehne the quantity 


ai := 2 - 2 " 


Y Ieo{x)Iei{x+ t).. .lE^ix + {k - l)t)'ilj(t/2'^)(p{2 '^x - j) dxdt 


(2.4) 


SO by the triangle inequality it suffices to show that 


Y ^ ^log 


R. 


\E. 


(2.5) 


where the sum is over dyadic intervals / of length between r and R. 


From the triangle inequality and fl2.4p we have the bound 

ai < 1 ( 2 . 6 ) 

for all /. We also have the following estimate: 

Lemma 2.3. ITe have 

Y ^e,p'^og^\Eo\ 

for any 1 < p < oo. 


^As we are working on the integers, we do not consider dyadic intervals of length less than 1. 
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Proof. We bound 

ai < 2“^” I I 1eo{x)1e:^{x + 1) dxdt 

Jx=(j+ 0 ( 1 )) 2 " 

^ f [ li5o(a;) ( [ ^eAv) dv) 

\Jx={j+0{l))2'^ / V2y=(i+0(1))2'» / 

< inf M 1 ^ 3 ( 0 :) Ml£;^(a;) 

xel 

where Mf{x) := sup^^.]^ 2 r^2j;sz |y-x|^r \fi.y)\ Hardy-Littlewood maximal oper¬ 

ator on the integers Z. Thus 

af |J| < 

xel 

and so (since each x e Z is contained in 0(log ■^) dyadic intervals / with r ^ |/| ^ R) 

Y, af\I\ < [log 2 M IeA^Y'^ M IeA^Y'^ 

and the claim follows from the Cauchy-Schwarz inequality, the Hardy-Littlewood max¬ 
imal inequality, and fl 2 . 2 p . □ 


From the above lemma with p = 3/2 (say) we see in particular that 

Y a/|/| <£ h^/'^log^|£^o| 

so to prove fl2.5p it suffices by (12.61) to show that 

Y \d\ ^ £'^og^\Eo\ (2.7) 

I\r^\I\^R;aj>5 

for any 5 > 0, whenever R/r is sufficiently large depending on e, 6. 

In the next section, we will establish the following result. 

Proposition 2.4 (Cancellation in a tree). Let Iq be a dyadic interval, and let 5 > 0. 
Then there exists a quantity 1 < H <<5 1 (which can depend on Iq) such that 

Y a/lJl < 5|/o| logH. 


The point here, of course, is the gain of 5 on the right-hand side, since otherwise the 
claim is immediate from the trivial bound ( 12 . 6 p . 

Let us assume this proposition for the moment and see how to conclude ( 12 .7p . Call a 
dyadic interval / bad if r < |/| ^ 7? and a/ > 6. From the proof of Lemma 12.31 (bounding 
MIe, crudely by 1 ) we see that 

M1e„(x)»5 

whenever x lies in a bad interval. In particular, from the Hardy-Littlewood maximal 
inequality we see that there are only hnitely many bad intervals. 
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Let X denote the collection of bad dyadic intervals. Using a greedy algorithm (starting 
with the largest bad intervals and only moving on to the smaller bad intervals once all 
the largest onces have been covered), as well as Proposition 12.41 fwith 5 replaced by 
we may cover X by a family T of disjoint “trees” T, each of the form T = {I : I cz 
It', \It\/At ^ |/| ^ |/t|} for some “tree top” It ^ X and some qnantity 1 ^ At <<£,5 1, 
with the property that 

Yj ai\I\ < e5\lT\log At- 

leT 

Note that we only reqnire that the top It of the tree T he in X; the other elements of 
T may lie ontside X. 

Snmming over all trees T e T, we conclude that 

L ui s 2 £51 Jr I log At- 
I:r^\I\^R-,ai>S TeT 

On the other hand, we have 

\It\ log At « ^ 1 /( t ) dx 

7gT 

for each tree TeT, and thus (by the disjointness of the trees X) 

Y Yj ^li^) dx. 

For each x in the support of S/gUj, have x e T for some tree top It, and 

thus Ml^p(a;) » 6. By the Hardy-Littlewood maximal inequality, we thus see that x is 
contained in a set of measure 0{\Eo\/d). Finally, by construction, every interval J in a 
tree TeT has size at most R and at least r/A t ^e,s f, cind so each x is contained in 
at most 0(log intervals if R/r is sufficiently large depending on e, 6. Putting all this 
together we obtain fl2.7p as required. 

It remains to establish Proposition 12.41 This will be accomplished in the next section. 

3. Applying the arithmetic regularity and counting lemmas 

By translation we may assume that 

Io = {l,...,N}=: [A] 

for some natural number N which is a power of 2. Our task is to find 1 < A <<5 1 such 
that 

N/2"-l 

Y Y 2"” 2 li?o(x)lE,(a; + f)...lE,(x + (/c-l)f)7/;(t/2’^)(p(2-^x-i) 

n:N/As;2"^N i=0 x,teZ 

< 6N log A. 

(3.1) 

We will in fact produce an A with A ^ Cs, where is a sufficiently large quantity 
depending on 5. We may assume that N is sufficiently large depending on 5, since 
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otherwise we can use the trivial bound of O(A^logA^) on the left-hand side (coming 
from the fact that there are only O(logiV) choices for n) to conclude, after choosing A 
large enough. 


Let El ■.= Ei r\ [A^]. For each n with N/A ^ 2^ ^ N, we may replace the Et by i?' 
in fl3.ip for all but 0(1) choices of j (coming from those j near 0 or iV/2”'). The total 
error in replacing Ei by i?' is thus 

2 0{2-'^2^^) = 0{N) 

which is acceptable since A ^ Cs for some large Cs- It thus suffices to show that 


Af/ 2 "-l 

L L 2 - 

n:A/A!£ 2 "^Af j=0 


< 6 N log A 


2 Ie',{x)Ie{{x + t).. .1 e'^{x + kt)'ilj{t/2^)ip{2 ”x-j) 

X^tG'Z 


(3.2) 


for some O^ ^ A <<5 1 . 


To control this expression we recall the Gowers uniformity norms from lain]. If /: 
Z/A^Z —> C is a function on a cyclic group ^LlN^L and d ^ 1, we dehne the Gowers 
uniformity norm ||/||{/d(z/ 7 vz) by the formula 


1 / 2 '^ 


ll/ll;7'^(Z/AZ) — 


jV '^+1 




A 


hi 


^hj{x) 


,...,,xeZ/7VZ 


where Ahf{x) := f{x + h)f{x). Given instead a function / : [N] ^ C on the interval 
[N], we define the Gowers norm ||/||i 7 d([^]) by the formula 

\\f\\u'i{[N]) = \\f\\u‘^{Z/N'Z)/\mN]\\u<i{Z/N'Z) 

for any N' ^ 2‘^N, where we embed [A] into 'LjN'Jj and extend / by zero outside of 
[A]; it is easy to see that this definition does not depend on the choice of A. We have 
the generalised von Neumann theorem 


1 


2 + t)... fk{x + kt) 

x^tsTj 




(3.3) 


for any fo,..., fk : [A] ^ C bounded in magnitude by 1 (extending by zero outside of 
[A]); see e.g. [I2l Lemma 11.4] (after applying the embedding of [A] into some suitable 
Z/A'Z). We have the following variant: 


Lemma 3.1. Let d := max(/i:,2). Then for any n with 2"' < N, any j ^ T, and any 
functions fo,..., fk : [A] ^ C bounded in magnitude by 1 (and extended by zero outside 
of [A], we have 


1 

iW 


Yj /o(^)/i(a^ + t)... fk{x + kt) 2 p{t/ 2 ^)i.p {2 ^x - j) 
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In particular, by the triangle inequality we have 


N/2”-l 

S S 2 -” 

j=0 


2 + t)... fk{x + kt)i){t/2'^)(p{2 - j) 

3^,tGZ 


(3.4) 




Proof. By adding a dummy function if necessary we may assume that k ^ 2, so 
that d = k. By Fourier inversion we may write 

^jJ{t) = r du 

Jr 


and 

(^(x) = r e2™^(^(r;) dv 

JR 


for some rapidly decreasing (and in particular, absolutely integrable) functions V’) ^ • 
M ^ C. By the triangle inequality, it thus suffices to show that 


1 


2 Ux)fi{x + t)... Mx + 

X^tG'Z 




uniformly for all m, n e R. Writing 

fo{x)fi{x + x f^{x + t)e2"("+*) 

and applying fl3.3p . we see that 


1 

iW 


J] fo{x)Mx + t)... Mx + 

X^tG'Z 


< 


inf 

O^i^k 


fi\\u>^{[N]) 


where fi is fi modulated by a Fourier character x gSTna;?; some t e R. But if A: ^ 2, 
one easily verihes that fi has the same norm as fi (because Ahj^Ah^fi = Ah^^Ah^fi 
for any hi, h 2 ), and the claim follows. □ 


We also need a similar statement in which the norm is replaced by the If norm: 
Lemma 3.2. For any fo, ■ ■ ■, fk ■ [-^] ^ C hounded in magnitude by 1, we have 


N/2^-1 

S S 2 - 

n:N/A!£2"^iV i=0 


2 + t)...fk{x + kt)ip{t/2'^)ip{2 - j) 

x^IgIa 


< A^^/2(log44) inf ||/i||^ 2 ([^]). 


Proof. Let 0 < z < /c. Observe that for each integer y and each n with N/A ^ 2^ ^ N, 
there are at most 0(1) choices of j for which the sum Yjx isz fo{x)fi{x + t)... fk{x + 
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kt)'ijj{t/2"')ip{2 ‘^x — j) contains a non-zero term involving fi{y), and when this is the 
case there are 0(2"') snch terms. From this and the triangle inequality we see that 

n-.N/A^2^^N i=0 x,teZ 

< {\ogA)Y,\fiiy)\ 

yeZ 

and the claim now follows from the Cauchy-Schwarz inequality. □ 

The above lemmas show, roughly speaking, that we may freely modify each of the 1e' by 
errors that are either small in (normalised) i‘^{[N]) norm, or extremely small in f/'^([A^]) 
norm. To exploit this phenomenon, we will need the arithmetic regularity lemma from 
|1] that asserts that all bounded functions on [A^], up to errors of the above form, can 
be expressed as a very well distributed nilsequence. To make this statement precise, we 
need to recall a large number of definitions from |1] (although for the purposes of this 
paper, many of the concepts defined here can be taken as “black boxes”). 

Definition 3.3 (Filtered nilmanifold). Let s ^ 1 be an integer. A filtered nilmanifold 
G/T = {G/T,G,) of degree < s consists of the following data: 

(1) A connected, simply-connected nilpotent Lie group G; 

(2) A discrete, cocompact subgroup F of (7 (thus the quotient space G/T is a compact 
manifold, known as a nilmanifold)] 

(3) A filtration G, = (Gp))® g of closed connected subgroups 

G = G(o) = G(i) ^ G{2) ^ ... 

of G, which are rational in the sense that the subgroups F(j) := F n G(i) are 
cocompact in Gp), such that [Gp),G(j)] c Gp+j) for all i,j ^ 0, and such that 
G(i) = {id} whenever i > s] 

(4) A Mal’cev basis X = (Ai,..., Xdim(G')) adapted to G,, that is to say a basis 
Ai,..., Adijn(G') of the Lie algebra of G that exponentiates to elements of F, 
such that Xj, ... , Adim(G) span a Lie algebra ideal for all j ^ i ^ dim(G), and 
^dim(G)-dim(G(g)+i, ■ ■ ■, A:dim(G) spaus the Lie algebra of Gp) for all 1 ^ z ^ s. 

One may use a Mal’cev basis to define a metric da/r on the nilmanifold G/F, as per Rl 
DeBnition 2,2], 

Definition 3.4 (Complexity). Let M ^ 1. We say that a filtered nilmanifold G/F = 
(G/F, G.) has complexity < M if the dimension of G, the degree of G,, and the ratio¬ 
nality of the Mal’cev basis X (cf. [3l Definition 2.4]) are bounded by M. 

Definition 3.5 (Polynomial sequence). Let (G/F,G,) be a filtered nilmanifold, with 
hltration G, = (G(i))® g. A polynomial sequence adapted to this hltered nilmanifold is 
a sequence g : Z G with the property that 

dh^... dh,g{n) e Gp) 

for all z ^ 0 and hi,..., hi,n e Z, where dhg{n) := g{ri + h)g{n)~^ is the derivative of g 
with respect to the shift h. 
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Definition 3.6 (Orbits). Let s ^ 1 be an integer, and let M,A > 0 be parameters. A 
polynomial orbit of degree < s and complexity < M is any fnnction n i—> g{n)T from 
Z ^ G/r, where (G/L, G,) is a filtered nilmanifold of complexity < M, and g : Tj ^ G 
is a polynomial seqnence. 

Definition 3.7 (Nilseqnences). A {polynomial) nilsequence of degree < s and complex¬ 
ity < M is any fnnction / : Z ^ C of the form f{n) = F{g{n)T), where n i—» g{n)T is 
a polynomial orbit of degree ^ s and complexity ^ M, and F : G/L ^ C is a fnnction 
of Lipschitz norm@ at most M. 

Definition 3.8 (Virtnal nilseqnences). Let A^ ^ 1. A virtual nilsequence of degree 
< s and complexity < M at scale N is any fnnction / : [A^] ^ C of the form f{n) = 
F{g{n)r,n mod q,n/N), where 1 < g < M is an integer, n i—> g{n)T is a polynomial 
orbit of degree < s and complexity < M, and F : G/L x Z/gZ x R ^ C is a fnnction of 
Lipschitz norm at most M. (Here we place a metric on G/L x Z/gZ x R in some arbitrary 
fashion, e.g. by embedding Z/gZ in R/Z and taking the direct snm of the metrics on 
the three factors.) We define a vector valued virtual nilsequence f : [A^] ^ similarly, 
except that F now takes valnes in C'^ instead of C. 


We now have almost all the definitions needed to state the arithmetic regnlarity lemma: 

Theorem 3.9 (Arithmetic regnlarity lemma). Let f : [A^] ^ [0,1]'^ be a function 
for some d ^ 0, let s ^ 1 be an integer, let e > 0, and let F : R'^ ^ R^ be a 
monotone increasing function with F{M) ^ M for all M. Then there exists a quantity 
M = Os,e,T,d{^) o,nd a decomposition 

f fnil fsml F fnnl 

of f into functions /nu, /smi, /unf : [A^] [-1,1]'^ of the following form: 


(1) {fni\,i structured) f-^n is a {F{M), N)-irrational vector-valued virtual nilsequence 
of degree ^ s, complexity ^ M, and scale N, where the notion of {A,N)- 
irrationality is defined in [U Definition A.6]; 

(2) (/smi small) Each of the d components of /smi has an i‘^{[N]) norm of at most 
eAi/2; 

(3) (/unf vory uniform) Each of the d components of f^ni has a f/®'''^([A^]) norm of 
at most 1/F{M); 

(4) {Nonnegativity) f^a and fnn + /smi take values in [0,1]'^. 


Proof. See [H Theorem 1.2]. Strictly speaking, this theorem is only stated in the scalar 
case d = 1, bnt the same argument extends without difficulty to the vector-valued case 
d ^ 1 (note we allow our bound on M to depend on d). We remark that the bounds 
on M in the above theorem are extremely poor (at least tower-exponential or worse, in 


^The (inhomogeneous) Lipschitz norm || J^lhip of a function F : AT C on a metric space X = {X, d) 
is defined as 


||F||Lip := sup |F(a:)| + sup 

xeX x,yEX\x^y 


\F{x)-F{y)\ 

\x-y\ 
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practice) for a variety of reasons, including the lack of good (or indeed any) quantitative 
bounds for the inverse theorem for higher order Gowers uniformity norms. □ 


We apply this lemma with e replaced by s + 1 replaced by d, d replaced by fc + 1, 
and a rapidly increasing function to be chosen later, to obtain decompositions 

fm\,i “I” /smlji ”1“ funf,i 

with /nil,*, /smi,*, /unf,i : [N] ^ [-1,1] being the components of functions /nii, fsmi, /unf : 
[A^] ^ [—1,1]^+^ obeying the conclusions of Theorem 13.91 By the triangle inequality, 
the left-hand side of 03.21) can be written as the sum of = 0(1) terms, in which each 
of the 1e'. has been replaced by one of /nii,*, /smi,i, /unf,n By Lemma IXTl the contribution 
of any term involving one of the /unf,* is at most Oa{N/J^{M)), while from Lemma [221 
the contribution of any term involving one of the /smi,* is 0{6N log A). Thus, one may 
bound the left-hand side of 03.21) by 


n:N/Asi2"s:N 




+ Oa{N/T{M)) + 0{SN log A) 


where Xn is the quantity 


Xn: = 


N/2^-1 

i=o 


^ /nil,o(a;)/nil,l(a: + t) ... fnil,k{x + kt)'ijj{t/2'^)ip{2 - j) 

3:,/gZ 


We now turn to the estimation of Xn. Bounding each /nii,* by 0(1), we have the trivial 
bound 

Xn<N (3.5) 

which we will use for values of 2"' that are close to N. For the remaining values of n, 
we argue as follows. By Dehnition 13.81 we have 

/nii,*(T -I- it) = Fi ^g{x + it)T, x + it mod q, 

whenever 0 ^ i ^ k and x + it e [A^], for some positive integer q = Om( 1), some 
(J^(M), A^)-irrational polynomial orbit n <-->■ g{n)r of degree < d — 1 and complexity 
Om( 1) into a hltered nilmanifold G/V of degree ^ d — 1 and complexity Om( 1), and a 
function F* : G/T x Z/gZ x R ^ C of Lipschitz norm at most Om(1)- 


x + it\^ 

) 


For all but 0(1) values of 0 ^ j ^ N/2^ — 1, one has x, x + t,..., x + kt e [A^] in the 
support of '0(f/2"')(p(2“"x — j). The exceptional values of j contribute 0(2”) to Xn, 
thus 


Xn « 


V/2"-l 

L 

i=o 


2j fl ( d(a^ + x + it mod q, ) /^(t/2”)(p(2 ”x - j) 

x,teZ *=0 '' 


+ 2 ”. 


By the triangle inequality, we thus have 


« 2-2”AT 


sn Fi{g{x + it)r, x + it mod 


x,tGZ i=0 


x + it 
N 


)^(t/2”)v2(2-”x-j) 


+ 2" 
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for some 0 < j < N/2^. Splitting x,t into residue classes modulo q = OuiX) and using 
the triangle inequality, we thus have 


X, «M 2-2-iV 


Yj n + it)T, a + ib mod q, - j) 

x^ts'L\x=a mod q,t=b mod q 2=0 


+2" 


for some 0 ^ a,b < q. Next, on the support of ';/t(t/2”)(p(2 "‘x — j) we have = 
^ + O(^), hence by the Lipschitz property 

X zt 

Fi{g{x + it)T, a + ib mod q, —+ *0^) + Om 



where FI = : G/V 


is the function 


i‘F 

F-{x) := Fi [ X, a + ib mod q, ) . 


Note that F' has a Lipschitz norm of Om( 1)- We conclude that 

k 


Xn «M 2“2"X 


Y + it)T)i){t/2'^)ip{2 ^x-j) 

x^tG'Z]x=a mod q^t=b mod q 2=0 


+ 2" 


Making the substitution x = qx' + a, t = qt' + 6, this becomes 

k 


X„ «u 2-2”Af 


L n Fij{g{qx' + iqt' + a + ib)T)il;{ 

x' 2^0 


qt' + a 


M2-^qx' + 2-^a- j) 


+ 2 ". 


At this point, we use the counting lemma from [H Theorem 1.11], This gives the 
asymptotic 

k 

T n F'ijigiQx' + iqt' + a + ib)T) — |/|| J|a + o^(m)^oo;m(A^^) + 07 v^oo;m(-^^) 

x'el,t'ej i =0 


for any intervals /, J c [—N, X], where a is a quantity independent of /, J (it is given by 
an explicit integral of a certain Lipschitz function on a certain hltered nilmanifold, but 
its precise value is immaterial for the current argument). Since the left-hand side of this 
asymptotic is Om(|L|| J|), we may assume without loss of generality that a = Om( 1)- A 
routine Riemann sum argument using the bound 2"' ^ N/A and the smooth nature of 
■0, if (decomposing the x', t' variables into intervals of length then shows that 


Y fl + iqt' + a + z6)r)V^( ^^ ^ ^ )ip{2 '^qx'+ 2 '"a-j) 

i=0 

= a \ ^ )ip{2~"'qx' + 2~"'a — j) dx'dt' + 0{A~^^N'^) 

Jr2 2" 

(X^) -f Oat 

—>00;M,A (X^). 

Since ip is odd, the integral here vanishes. We conclude (since 2“^"'X « A^X“^) that 
«M Ojr(^M)^oo-,M,A{N) + 0]Sf^aD-,M,AiN) + A ®°X -t- 2”. 
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Using this bound for 2” < A ^N, and the trivial bound fl3.5l) for A < n ^ N, we 
have 

^ Xn « Oj-(M)^oo;Ar,A(^) + 0 n^od-,m,a{N) + Om{A ^N) + 6N\ogA 

n:A/AsC2"^A 

and so we may bound the left-hand side of fl3.2p by 

ot{m)^cc-,m,a{X) + on^co;M,a{N) + Om{A ^N) + 0((fA^logv4). 

If we choose A sufficiently large depending on 6, M, and then X sufficiently rapidly 
growing, and then N sufficiently large depending on S,X (recalling that M = 05 _^( 1 )), 
we can make this expression 0(5A^log74), giving fl3.2|) as required. 

Remark 3.10. The above arguments also provide some non-trivial cancellation for 
(truncations of) other variants of the multilinear Hilbert transform. For instance, one 
replace Hk by the maximal truncated /c-linear Hilbert transform 

sup \Hk,r,Rf{x)\; 
r<R 

one could also consider the multilinear Hilbert transform 

r dt 

p.v. fi{x + Cit)... fk{x + Ckt) — (3.6) 

Jr ^ 

with rational coefficients Ci,..., c^; we leave the modihcation of the above arguments to 
these operators to the interested reader. Curiously, there appears to be some difficulty 
extending the arguments to the final variant fl3.6p if the ci,... ,ca; are not rationally 
commensurate, as one cannot easily discretise in this case to deploy additive combina¬ 
torics tools. A somewhat similar phenomenon appeared previously in [1]. It may be 
possible to get around this difficulty by developing a continuous version of the arith¬ 
metic regularity and counting lemmas. It is also possible, in principle at least, to obtain 
a corresponding result for the maximal multilinear Hilbert transform or by a polynomial 
Carleson type operator 

sup p.y.( Mx + t)... h{x + A:f)e2-^W ^ 

P Jr t 

where P ranges over all polynomials P : R ^ M of degree bounded by some fixed d, 
although this may require a more sophisticated counting lemma than the one given in 

111 - 
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